Size-aware Sharding For Improving Tail Latencies in In-memory Key-value Stores

نویسندگان

  • Diego Didona
  • Willy Zwaenepoel
چکیده

This paper introduces the concept of size-aware sharding to improve tail latencies for in-memory key-value stores, and describes its implementation in the Minos key-value store. Tail latencies are crucial in distributed applications with high fan-out ratios, because overall response time is determined by the slowest response. Size-aware sharding distributes requests for keys to cores according to the size of the item associated with the key. In particular, requests for small and large items are sent to disjoint subsets of cores. Size-aware sharding improves tail latencies by avoiding head-of-line blocking, in which a request for a small item gets queued behind a request for a large item. Alternative size-unaware approaches to sharding, such as keyhash-based sharding, request dispatching and stealing do not avoid head-of-line blocking, and therefore exhibit worse tail latencies. The challenge in implementing size-aware sharding is to maintain high throughput by avoiding the cost of software dispatching and by achieving load balancing between different cores. Minos uses hardware dispatch for all requests for small items, which form the very large majority of all requests. It achieves load balancing by adapting the number of cores handling requests for small and large items to their relative presence in the workload. We compare Minos to three state-of-the-art designs of in-memory KV stores. Compared to its closest competitor, Minos achieves a 99th percentile latency that is up to two orders of magnitude lower. Put differently, for a given value for the 99th percentile latency equal to 10 times the mean service time, Minos achieves a throughput that is up to 7.4 times higher.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Skyler: Dynamic, Workload-Aware Data Sharding across Multiple Data Centres

Popular online services such as Facebook and Twitter use multiple data centers (DCs) to service globally distributed user requests with the lowest possible latency. Existing distributed data stores either fully replicate or statically shard data across DCs. Full replication limits scalability and static sharding fixes data to a single sharding policy, unable to adapt with user and popularity sh...

متن کامل

A Low-Power Packet Memory Architecture with a Latency-Aware Packet Mapping Method

A packet memory stores packets in internet routers and it requires typically RTT ×C for the buffer space, e.g. several GBytes, where RTT is an average round-trip time of a TCP flow and C is the bandwidth of the router’s output link. It is implemented with DRAM parts which are accessed in parallel to achieve required bandwidth. They consume significant power in a router whose scalability is heav...

متن کامل

Warp: Lightweight Multi-Key Transactions for Key-Value Stores

Traditional NoSQL systems scale by sharding data across multiple servers and by performing each operation on a small number of servers. Because transactions on multiple keys necessarily require coordination across multiple servers, NoSQL systems often explicitly avoid making transactional guarantees in order to avoid such coordination. Past work on transactional systems control this coordinatio...

متن کامل

Scaling Out to a Single-Node 80Gbps Memcached Server with 40Terabytes of Memory

Current web infrastructure relies increasingly on distributed in-memory key-value stores such as memcached whereby typical x86-based implementations of TCP/IP compliant memcached yield limited performance scalability. FPGA-based data-flow architectures overcome and exceed every other published and fully compliant implementation in regards to throughput and provide scalability to 80Gbps, while o...

متن کامل

بررسی اثر عصاره آبی زعفران برحافظه و یادگیری موش‌های صحرایی نر دیابتی ناشی از تزریق استرپتوزوتوسین (Streptozotocin)

  Background: Diabetes mellitus is accompanied with disturbances in learning, memory, and cognitive skills in the human society and experimental animals. Considering the potential anti-diabetic effect of the medicinal plant Crocus sativus (Saffron) and the augmenting effect of its consumption on the memory and mental health, this study was conducted to evaluate the effect of chronic interaperit...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • CoRR

دوره abs/1802.00696  شماره 

صفحات  -

تاریخ انتشار 2018